Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
1.
Clin Cancer Res ; 29(7): 1220-1231, 2023 04 03.
Artigo em Inglês | MEDLINE | ID: mdl-36815791

RESUMO

PURPOSE: Patients with resected localized clear-cell renal cell carcinoma (ccRCC) remain at variable risk of recurrence. Incorporation of biomarkers may refine risk prediction and inform adjuvant treatment decisions. We explored the role of tumor genomics in this setting, leveraging the largest cohort to date of localized ccRCC tissues subjected to targeted gene sequencing. EXPERIMENTAL DESIGN: The somatic mutation status of 12 genes was determined in 943 ccRCC cases from a multinational cohort of patients, and associations to outcomes were examined in a Discovery (n = 469) and Validation (n = 474) framework. RESULTS: Tumors containing a von-Hippel Lindau (VHL) mutation alone were associated with significantly improved outcomes in comparison with tumors containing a VHL plus additional mutations. Within the Discovery cohort, those with VHL+0, VHL+1, VHL+2, and VHL+≥3 tumors had disease-free survival (DFS) rates of 90.8%, 80.1%, 68.2%, and 50.7% respectively, at 5 years. This trend was replicated in the Validation cohort. Notably, these genomically defined groups were independent of tumor mutational burden. Amongst patients eligible for adjuvant therapy, those with a VHL+0 tumor (29%) had a 5-year DFS rate of 79.3% and could, therefore, potentially be spared further treatment. Conversely, patients with VHL+2 and VHL+≥3 tumors (32%) had equivalent DFS rates of 45.6% and 35.3%, respectively, and should be prioritized for adjuvant therapy. CONCLUSIONS: Genomic characterization of ccRCC identified biologically distinct groups of patients with divergent relapse rates. These groups account for the ∼80% of cases with VHL mutations and could be used to personalize adjuvant treatment discussions with patients as well as inform future adjuvant trial design.


Assuntos
Carcinoma de Células Renais , Neoplasias Renais , Humanos , Carcinoma de Células Renais/genética , Carcinoma de Células Renais/terapia , Carcinoma de Células Renais/metabolismo , Neoplasias Renais/genética , Neoplasias Renais/terapia , Neoplasias Renais/metabolismo , Proteína Supressora de Tumor Von Hippel-Lindau/genética , Recidiva Local de Neoplasia/genética , Mutação
2.
J Bioinform Comput Biol ; 20(5): 2250021, 2022 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-36102744

RESUMO

We present hybrid system-based gene regulatory network models for lambda, HK022, and Mu bacteriophages together with dynamics analysis of the modeled networks. The proposed lambda phage model LPH2 is based on an earlier work and incorporates more recent biological assumptions about the underlying gene regulatory mechanism, HK022, and Mu phage models are new. All three models provide accurate representations of experimentally observed lytic and lysogenic behavioral cycles. Importantly, the models also imply that lysis and lysogeny are the only stable behaviors that can occur in the modeled networks. In addition, the models allow to derive switching conditions that irrevocably lead to either lytic or lysogenic behavioral cycle as well as constraints that are required for their biological feasibility. For LPH2 model the feasibility constraints place two mutually independent requirements on comparative order of cro and cI protein binding site affinities. However, HK022 model, while broadly similar, does not require any of these constraints. Biologically very different lysis-lysogeny switching mechanism of Mu phage is also accurately reproduced by its model. In general the results show that hybrid system model (HSM) hybrid system framework can be successfully applied to modeling small ([Formula: see text] gene) regulatory networks and used for comprehensive analysis of model dynamics and stable behavior regions.


Assuntos
Redes Reguladoras de Genes , Lisogenia , Bacteriófago lambda/genética , Ligação Proteica , Sítios de Ligação
3.
Virchows Arch ; 478(6): 1099-1107, 2021 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-33403511

RESUMO

There are unexplained geographical variations in the incidence of kidney cancer with the high rates reported in Baltic countries, as well as eastern and central Europe. Having access to a large and well-annotated collection of "tumor/non-tumor" pairs of kidney cancer patients from the Czech Republic, Romania, Serbia, UK, and Russia, we aimed to analyze the morphology of non-neoplastic renal tissue in nephrectomy specimens. By applying digital pathology, we performed a microscopic examination of 1012 frozen non-neoplastic kidney tissues from patients with renal cell carcinoma. Four components of renal parenchyma were evaluated and scored for the intensity of interstitial inflammation and fibrosis, tubular atrophy, glomerulosclerosis, and arterial wall thickening, globally called chronic renal parenchymal changes. Moderate or severe changes were observed in 54 (5.3%) of patients with predominance of occurrence in Romania (OR = 2.67, CI 1.07-6.67) and Serbia (OR = 4.37, CI 1.20-15.96) in reference to those from Russia. Further adjustment for comorbidities, tumor characteristics, and stage did not change risk estimates. In multinomial regression model, relative probability of non-glomerular changes was 5.22 times higher for Romania and Serbia compared to Russia. Our findings show that the frequency of chronic renal parenchymal changes, with the predominance of chronic interstitial nephritis pattern, in kidney cancer patients varies by country, significantly more frequent in countries located in central and southeastern Europe where the incidence of kidney cancer has been reported to be moderate to high. The observed association between these pathological features and living in certain geographic areas requires a larger population-based study to confirm this association on a large scale.


Assuntos
Carcinoma de Células Renais/patologia , Fibrose/patologia , Neoplasias Renais/patologia , Rim/patologia , Adulto , Idoso , Europa (Continente) , Feminino , Taxa de Filtração Glomerular/fisiologia , Humanos , Masculino , Pessoa de Meia-Idade , Nefrectomia/métodos , Federação Russa
4.
Proteomics ; 20(21-22): e2000009, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-32937025

RESUMO

Mass spectrometry (MS)-based quantitative proteomics experiments typically assay a subset of up to 60% of the ≈20 000 human protein coding genes. Computational methods for imputing the missing values using RNA expression data usually allow only for imputations of proteins measured in at least some of the samples. In silico methods for comprehensively estimating abundances across all proteins are still missing. Here, a novel method is proposed using deep learning to extrapolate the observed protein expression values in label-free MS experiments to all proteins, leveraging gene functional annotations and RNA measurements as key predictive attributes. This method is tested on four datasets, including human cell lines and human and mouse tissues. This method predicts the protein expression values with average R2 scores between 0.46 and 0.54, which is significantly better than predictions based on correlations using the RNA expression data alone. Moreover, it is demonstrated that the derived models can be "transferred" across experiments and species. For instance, the model derived from human tissues gave a R2=0.51 when applied to mouse tissue data. It is concluded that protein abundances generated in label-free MS experiments can be computationally predicted using functional annotated attributes and can be used to highlight aberrant protein abundance values.


Assuntos
Aprendizado Profundo , Animais , Espectrometria de Massas , Camundongos , Anotação de Sequência Molecular , Proteínas , Proteômica
5.
J Bioinform Comput Biol ; 18(3): 2040008, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32698721

RESUMO

Current high-throughput experimental techniques make it feasible to infer gene regulatory interactions at the whole-genome level with reasonably good accuracy. Such experimentally inferred regulatory networks have become available for a number of simpler model organisms such as S. cerevisiae, and others. The availability of such networks provides an opportunity to compare gene regulatory processes at the whole genome level, and in particular, to assess similarity of regulatory interactions for homologous gene pairs either from the same or from different species. We present here a new technique for analyzing the regulatory interaction neighborhoods of paralogous gene pairs. Our central focus is the analysis of S. cerevisiae gene interaction graphs, which are of particular interest due to the ancestral whole-genome duplication (WGD) that allows to distinguish between paralogous transcription factors that are traceable to this duplication event and other paralogues. Similar analysis is also applied to E. coli and C. elegans networks. We compare paralogous gene pairs according to the presence and size of bi-fan arrays, classically associated in the literature with gene duplication, within other network motifs. We further extend this framework beyond transcription factor comparison to obtain topology-based similarity metrics based on the overlap of interaction neighborhoods applicable to most genes in a given organism. We observe that our network divergence metrics show considerably larger similarity between paralogues, especially those traceable to WGD. This is the case for both yeast and C. elegans, but not for E. coli regulatory network. While there is no obvious cross-species link between metrics, different classes of paralogues show notable differences in interaction overlap, with traceable duplications tending toward higher overlap compared to genes with shared protein families. Our findings indicate that divergence in paralogous interaction networks reflects a shared genetic origin, and that our approach may be useful for investigating structural similarity in the interaction networks of paralogous genes.


Assuntos
Caenorhabditis elegans/genética , Biologia Computacional/métodos , Escherichia coli/genética , Redes Reguladoras de Genes , Saccharomyces cerevisiae/genética , Animais , Evolução Molecular , Duplicação Gênica , Genoma , Fatores de Transcrição/genética
6.
BMC Bioinformatics ; 20(Suppl 23): 618, 2019 Dec 27.
Artigo em Inglês | MEDLINE | ID: mdl-31881819

RESUMO

BACKGROUND: Current Hi-C technologies for chromosome conformation capture allow to understand a broad spectrum of functional interactions between genome elements. Although significant progress has been made into analysis of Hi-C data to identify biologically significant features, many questions still remain open, in particular regarding potential biological significance of various topological features that are characteristic for chromatin interaction networks. RESULTS: It has been previously observed that promoter capture Hi-C (PCHi-C) interaction networks tend to separate easily into well-defined connected components that can be related to certain biological functionality, however, such evidence was based on manual analysis and was limited. Here we present a novel method for analysis of chromatin interaction networks aimed towards identifying characteristic topological features of interaction graphs and confirming their potential significance in chromatin architecture. Our method automatically identifies all connected components with an assigned significance score above a given threshold. These components can be subjected afterwards to different assessment methods for their biological role and/or significance. The method was applied to the largest PCHi-C data set available to date that contains interactions for 17 haematopoietic cell types. The results demonstrate strong evidence of well-pronounced component structure of chromatin interaction networks and provide some characterisation of this component structure. We also performed an indicative assessment of potential biological significance of identified network components with the results confirming that the network components can be related to specific biological functionality. CONCLUSIONS: The obtained results show that the topological structure of chromatin interaction networks can be well described in terms of isolated connected components of the network and that formation of these components can be often explained by biological features of functionally related gene modules. The presented method allows automatic identification of all such components and evaluation of their significance in PCHi-C dataset for 17 haematopoietic cell types. The method can be adapted for exploration of other chromatin interaction data sets that include information about sufficiently large number of different cell types, and, in principle, also for analysis of other kinds of cell type-specific networks.


Assuntos
Cromatina/química , Redes Reguladoras de Genes , Algoritmos , Regulação da Expressão Gênica , Hematopoese/genética , Humanos , Regiões Promotoras Genéticas
7.
BMC Bioinformatics ; 20(1): 296, 2019 May 31.
Artigo em Inglês | MEDLINE | ID: mdl-31151381

RESUMO

BACKGROUND: Gene regulatory networks can be modelled in various ways depending on the level of detail required and biological questions addressed. One of the earliest formalisms used for modeling is a Boolean network, although these models cannot describe most temporal aspects of a biological system. Differential equation models have also been used to model gene regulatory networks, but these frameworks tend to be too detailed for large models and many quantitative parameters might not be deducible in practice. Hybrid models bridge the gap between these two model classes - these are useful when concentration changes are important while the information about precise concentrations and binding site affinities is partial. RESULTS: In this paper we study the stable behaviours of phage λ via a hybrid system based model. We identify wild type and mutant behaviours that arise for various orderings of binding site affinities. We propose experiments for detecting these behaviours: we suggest several ways of altering binding affinities with either mutations or genome rearrangements to achieve modified behaviours. The feasibility of these experiments is assessed. The interplay between the qualitative aspects of a network, e.g. network topology, and quantitative parameters, e.g. growth and degradation rates of proteins, is demonstrated. We also provide a software for exploring all feasible states of a hybrid system model and identifying all attractors. CONCLUSIONS: The behaviours of phage λ are determined mainly by the topology of this network and by the mutual order of binding affinities. Exact affinities and growth and degradation rates of proteins fine tune the system. We show that only two stable behaviours are possible for phage λ if the main constraints of λ switch are preserved - these behaviours correspond to lysis and lysogeny. We identify several variants of both lysis and lysogeny - one wild type and one modified behaviour for each. We elucidate the necessary constraints for binding site affinities to achieve both wild type lysis and lysogeny. Our software is applicable to a wide range of biological models described as a hybrid system.


Assuntos
Bacteriófago lambda/genética , Regulação Viral da Expressão Gênica , Redes Reguladoras de Genes , Bacteriófago lambda/fisiologia , Lisogenia , Modelos Biológicos , Mutação , Óperon , Software
8.
Nat Commun ; 5: 5135, 2014 Oct 29.
Artigo em Inglês | MEDLINE | ID: mdl-25351205

RESUMO

The incidence of renal cell carcinoma (RCC) is increasing worldwide, and its prevalence is particularly high in some parts of Central Europe. Here we undertake whole-genome and transcriptome sequencing of clear cell RCC (ccRCC), the most common form of the disease, in patients from four different European countries with contrasting disease incidence to explore the underlying genomic architecture of RCC. Our findings support previous reports on frequent aberrations in the epigenetic machinery and PI3K/mTOR signalling, and uncover novel pathways and genes affected by recurrent mutations and abnormal transcriptome patterns including focal adhesion, components of extracellular matrix (ECM) and genes encoding FAT cadherins. Furthermore, a large majority of patients from Romania have an unexpected high frequency of A:T>T:A transversions, consistent with exposure to aristolochic acid (AA). These results show that the processes underlying ccRCC tumorigenesis may vary in different populations and suggest that AA may be an important ccRCC carcinogen in Romania, a finding with major public health implications.


Assuntos
Carcinoma de Células Renais/genética , Variação Genética , Genoma Humano/genética , Genômica , Adulto , Idoso , Idoso de 80 Anos ou mais , Estudos de Coortes , Europa (Continente) , Feminino , Adesões Focais/metabolismo , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Mutação , Taxa de Mutação , Proteínas de Fusão Oncogênica/genética , Fosfatidilinositol 3-Quinases/genética , Splicing de RNA/genética , Análise de Sequência de DNA , Transdução de Sinais/genética
9.
Gene ; 518(1): 70-7, 2013 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-23266641

RESUMO

The paper proposes a hybrid system based approach for modelling of intracellular networks and introduces a restricted subclass of hybrid systems - HSM - with an objective of still being able to provide sufficient power for the modelling of biological systems, while imposing some restrictions that facilitate analysis of systems described by such models. The use of hybrid system based models has become increasingly popular, likely due to the facts that: 1) they provide sufficiently powerful mathematical formalism to describe biological processes of interest and do it in a 'natural way' from the biological perspective; 2) there are well established mathematical techniques as well as supporting software tools for analysing such models. However often these models are very dependent on the quantitative parameters of the system (concentrations of proteins, their growth functions etc.) that are seldom exactly known, instead of more limited information of the system that can be observed in practice (directions of change in concentrations, but not the exact values etc.). As a result these models may work well for simulation of the system (prediction of its state starting from some initial conditions), but are too complicated for prediction of all possible qualitatively different behaviours a modelled system might have. With HSM we try to propose a hybrid system based formalism that is still sufficiently powerful for description of biological systems, while being as restricted as possible to facilitate the analysis of the systems described. We separate between the quantitative system parameters and their qualitative values that can be observed in practice. For HSM we provide an algorithm that analyses the system without the need to know the exact parameter values. We apply our model and analysis methods to a well-studied gene network of λ-phage. The phage has two well-known qualitatively different behaviours - lysis and lysogeny. We show that our model has an attractor structure that corresponds well to these two behaviours and that these are the only stable behaviours that can be exhibited by the system. The algorithm also generates (in principle biologically verifiable) hypotheses about the mutations of λ-phage that should change its observable behaviour.


Assuntos
Algoritmos , Bacteriófago lambda/genética , Redes Reguladoras de Genes , Modelos Biológicos , Bacteriófago lambda/fisiologia , Genes Virais , Lisogenia , Mutação , Regiões Promotoras Genéticas
10.
PLoS Genet ; 7(9): e1002270, 2011 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-21931564

RESUMO

We have performed a metabolite quantitative trait locus (mQTL) study of the (1)H nuclear magnetic resonance spectroscopy ((1)H NMR) metabolome in humans, building on recent targeted knowledge of genetic drivers of metabolic regulation. Urine and plasma samples were collected from two cohorts of individuals of European descent, with one cohort comprised of female twins donating samples longitudinally. Sample metabolite concentrations were quantified by (1)H NMR and tested for association with genome-wide single-nucleotide polymorphisms (SNPs). Four metabolites' concentrations exhibited significant, replicable association with SNP variation (8.6×10(-11)

Assuntos
Estudo de Associação Genômica Ampla , Redes e Vias Metabólicas/genética , Metaboloma/genética , Locos de Características Quantitativas/genética , Seleção Genética , Acetiltransferases/genética , Acetiltransferases/metabolismo , Dimetilaminas/sangue , Dimetilaminas/metabolismo , Feminino , Haplótipos , Humanos , Isobutiratos/metabolismo , Isobutiratos/urina , Espectroscopia de Ressonância Magnética , Metilaminas/metabolismo , Metilaminas/urina , Polimorfismo de Nucleotídeo Único
11.
Mol Syst Biol ; 7: 525, 2011 Aug 30.
Artigo em Inglês | MEDLINE | ID: mdl-21878913

RESUMO

¹H Nuclear Magnetic Resonance spectroscopy (¹H NMR) is increasingly used to measure metabolite concentrations in sets of biological samples for top-down systems biology and molecular epidemiology. For such purposes, knowledge of the sources of human variation in metabolite concentrations is valuable, but currently sparse. We conducted and analysed a study to create such a resource. In our unique design, identical and non-identical twin pairs donated plasma and urine samples longitudinally. We acquired ¹H NMR spectra on the samples, and statistically decomposed variation in metabolite concentration into familial (genetic and common-environmental), individual-environmental, and longitudinally unstable components. We estimate that stable variation, comprising familial and individual-environmental factors, accounts on average for 60% (plasma) and 47% (urine) of biological variation in ¹H NMR-detectable metabolite concentrations. Clinically predictive metabolic variation is likely nested within this stable component, so our results have implications for the effective design of biomarker-discovery studies. We provide a power-calculation method which reveals that sample sizes of a few thousand should offer sufficient statistical precision to detect ¹H NMR-based biomarkers quantifying predisposition to disease.


Assuntos
Biomarcadores , Interação Gene-Ambiente , Metaboloma/genética , Ressonância Magnética Nuclear Biomolecular/métodos , Biologia de Sistemas/métodos , População Branca/genética , Idoso , Algoritmos , Biomarcadores/sangue , Biomarcadores/urina , Bases de Dados Genéticas , Feminino , Variação Genética , Humanos , Pessoa de Meia-Idade , Modelos Estatísticos , Projetos de Pesquisa , Tamanho da Amostra , Gêmeos Dizigóticos/genética , Gêmeos Monozigóticos/genética
12.
Bioinformatics ; 25(20): 2768-9, 2009 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-19633095

RESUMO

UNLABELLED: SIMBioMS is a web-based open source software system for managing data and information in biomedical studies. It provides a solution for the collection, storage, management and retrieval of information about research subjects and biomedical samples, as well as experimental data obtained using a range of high-throughput technologies, including gene expression, genotyping, proteomics and metabonomics. The system can easily be customized and has proven to be successful in several large-scale multi-site collaborative projects. It is compatible with emerging functional genomics data standards and provides data import and export in accepted standard formats. Protocols for transferring data to durable archives at the European Bioinformatics Institute have been implemented. AVAILABILITY: The source code, documentation and initialization scripts are available at http://simbioms.org.


Assuntos
Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados , Gestão da Informação/métodos , Armazenamento e Recuperação da Informação/métodos , Software , Bases de Dados Factuais
13.
BMC Bioinformatics ; 8: 52, 2007 Feb 09.
Artigo em Inglês | MEDLINE | ID: mdl-17291344

RESUMO

BACKGROUND: One of the crucial aspects of day-to-day laboratory information management is collection, storage and retrieval of information about research subjects and biomedical samples. An efficient link between sample data and experiment results is absolutely imperative for a successful outcome of a biomedical study. Currently available software solutions are largely limited to large-scale, expensive commercial Laboratory Information Management Systems (LIMS). Acquiring such LIMS indeed can bring laboratory information management to a higher level, but often implies sufficient investment of time, effort and funds, which are not always available. There is a clear need for lightweight open source systems for patient and sample information management. RESULTS: We present a web-based tool for submission, management and retrieval of sample and research subject data. The system secures confidentiality by separating anonymized sample information from individuals' records. It is simple and generic, and can be customised for various biomedical studies. Information can be both entered and accessed using the same web interface. User groups and their privileges can be defined. The system is open-source and is supplied with an on-line tutorial and necessary documentation. It has proven to be successful in a large international collaborative project. CONCLUSION: The presented system closes the gap between the need and the availability of lightweight software solutions for managing information in biomedical studies involving human research subjects.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Armazenamento e Recuperação da Informação/métodos , Sistemas Computadorizados de Registros Médicos , Software , Interface Usuário-Computador , Inteligência Artificial , Engenharia Biomédica/métodos , Pesquisa Biomédica/métodos , Ensaios Clínicos como Assunto/métodos , Linguagens de Programação
14.
Bioinformatics ; 23(7): 832-41, 2007 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-17282999

RESUMO

MOTIVATION: The evolution of protein sequences can be described by a stepwise process, where each step involves changes of a few amino acids. In a similar manner, the evolution of protein folds can be at least partially described by an analogous process, where each step involves comparatively simple changes affecting few secondary structure elements. A number of such evolution steps, justified by biologically confirmed examples, have previously been proposed by other researchers. However, unlike the situation with sequences, as far as we know there have been no attempts to estimate the comparative probabilities for different kinds of such structural changes. RESULTS: We have tried to assess the comparative probabilities for a number of known structural changes, and to relate the probabilities of such changes with the distance between protein sequences. We have formalized these structural changes using a topological representation of structures (TOPS), and have developed an algorithm for measuring structural distances that involve few evolutionary steps. The probabilities of structural changes then were estimated on the basis of all-against-all comparisons of the sequence and structure of protein domains from the CATH-95 representative set. The results obtained are reasonably consistent for a number of different data subsets and permit the identification of several 'most popular' types of evolutionary changes in protein structure. The results also suggest that alterations in protein structure are more likely to occur when the sequence similarity is >10% (the average similarity being approximately 6% for the data sets employed in this study), and that the distribution of probabilities of structural changes is fairly uniform within the interval of 15-50% sequence similarity. AVAILABILITY: The algorithms have been implemented on the Windows operating system in C++ and using the Borland Visual Component Library. The source code is available on request from the first author. The data sets used for this study (representative sets of protein domains, matrices of sequence similarities and structural distances) are available on http://bioinf.mii.lu.lv/epsrc_project/struct_ev.html.


Assuntos
Evolução Molecular , Modelos Químicos , Modelos Genéticos , Proteínas/química , Proteínas/genética , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Simulação por Computador , Sequência Conservada , Interpretação Estatística de Dados , Modelos Moleculares , Dobramento de Proteína , Homologia de Sequência de Aminoácidos
15.
Genome Inform ; 16(2): 225-36, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-16901105

RESUMO

We study the Finite State Linear Model (FSLM) for modelling gene regulatory networks proposed by A. Brazma and T. Schlitt in [4]. The model incorporates biologically intuitive gene regulatory mechanism similar to that in Boolean networks, and can describe also the continuous changes in protein levels. We consider several theoretical properties of this model; in particular we show that the problem whether a particular gene will reach an active state is algorithmically unsolvable. This imposes some practical difficulties in simulation and reverse engineering of FSLM networks. Nevertheless, our simulation experiments show that sufficiently many of FSLM networks exhibit a regular behaviour and that the model is still quite adequate to describe biological reality. We also propose a comparatively efficient O(2(K)n(K+1)M(2K)m log m) time algorithm for reconstruction of FSLM networks from experimental data. Experiments on reconstruction of random networks are performed to estimate the running time of the algorithm in practice, as well as the number of measurements needed for successful network reconstruction.


Assuntos
Biologia Computacional/métodos , Simulação por Computador , Regulação da Expressão Gênica/genética , Modelos Lineares , Modelos Genéticos , Algoritmos , Animais , Engenharia de Proteínas/métodos
16.
Genome Inform ; 15(2): 72-81, 2004.
Artigo em Inglês | MEDLINE | ID: mdl-15706493

RESUMO

We describe a method for automated domain discovery for topological profile searches in protein structures. The method is used in a system TOPStructure for fast prediction of CATH classification for protein structures (given as PDB files). It is important for profile searches in multi-domain proteins, for which the profile method by itself tends to perform poorly. We also present an O(C(n)k + nk(2)) time algorithm for this problem, compared to the O(C(n)k + (nk)(2)) time used by a trivial algorithm (where n is the length of the structure, k is the number of profiles and C(n) is the time needed to check for a presence of a given motif in a structure of length n). This method has been developed and is currently used for TOPS representations of protein structures and prediction of CATH classification, but may be applied to other graph-based representations of protein or RNA structures and/or other prediction problems. A protein structure prediction system incorporating the domain discovery method is available at http://bioinf.mii.lu.lv/tops/.


Assuntos
Algoritmos , Inteligência Artificial , Estrutura Terciária de Proteína , Proteínas/química , Estrutura Molecular , Reconhecimento Automatizado de Padrão , Homologia de Sequência , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...